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ABSTRACT 



A method of enhancing the audio quality in a reproduction 
medium having unknown characteristics. With this method 
a predetermined finite set of single frequency tones are 
generated and these tones are then passed through the 
reproduction medium to generate an output signal, which in 
turn is passed through a set of sub-band filters. Each of the 
sub-band filters pass at least a frequency corresponding to 
one of the tones in the set of tones. The characteristics of the 
reproduction medium is then estimated as a result of passing 
the output signal through the set of sub-band filters. Based 
on the estimated characteristics of the reproduction medium, 
a set of sub-band inverse filters are constructed. Finally 
before passing the audio signal through the _reproduction 
medium the signal is passed through the set of inverse filters - 
to improve the quality of the audio signal after it passes 
through the reproduction medium. 



3 Claims, 3 Drawing Sheets 
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ACOUSTIC QUALITY ENHANCEMENT VIA 
FEEDBACK AND EQUALIZATION FOR 
MOBILE MULTIMEDIA SYSTEMS 

TECHNICAL FIELD 

The invention relates to the audio reproduction where the 
quality of the acoustic source is affected by unknown and 
possibly time-varying characteristics of the reproduction 
equipment and the environment, and, more particularly 
relates to the audio reproduction in mobile multimedia' 
systems where the low-cost speakers and the constantly 
changing environment introduce distortions to audio signals. 

DESCRIPTION OF THE PRIOR ART 

Audio reproduction in a mobile multimedia system often 
surfers from distortions introduced by poor quality speakers 
and environmental fluctuations. 

The subject of audio quality enhancement has been 
researched in considerable detail over the years. The articles 
entitled "Digital Equalization of Room Acoustics" by J N 
Mourjopoulos in the Journal of the Audio Engineering 
Society, Vol. 42, No. 11, pp. 884-900 (November 1994) and 
"Digital Car Audio Systems" by J. Kontro in IEEE Trans- 
actions on Consumer Electronics, Vol. 39 No 3 pp 
514-521 (August 1993), and the references contained 
therein provide some relevant background. The idea of using 
feedback of the audio source, modeling the reproduction 
medium as a filter, and inverse filtering (equalizing) the 
effects of the. reproduction medium is central to most of 
these approaches. The mechanisms for estimation of the 
medium, and for equalization vary considerably. The afore- 
mentioned Mourjopoulos article studies the problems 
encountered in using inverse filters; Primarily, since the 
impulse response of the reproduction medium tends to be 
long, the length of an inverse filter is also long, leading to 
computationally intensive algorithms. Further, a number of 
algorithms for implementing inverse filters tend to be 
unstable. The aforementioned Mourjopoulos article presents 
a method where the length of the inverse filter is shortened 
by using all-pole modeling and vector quantization of 
responses of the reproduction medium. The aforementioned 
Kontro article describes an audio system using an equalizer 
for gam control and for compensating for the medium's 
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This invention proposes to estimate the characteristics of 
the reproduction medium using a training signal consisting 
ot a set of pure frequency tones generated solely for the 
purpose of training, which also satisfies the low-complexity 
and short delay requirements described above, since the 
proposed filters that equalize the characteristics of the repro- 
duction medium have short lengths and the filter coefficients 
may be calculated with minimal complexity due to the 
simplicity of the training signal. Furthermore, this invention 
addresses the problem of acoustic quality enhancement in a 
dynamic environment, as opposed to the static environments 
considered in the prior art, since we propose to use the 
existing microphone and speakers, which form integral 
components of a mobile multimedia system. Thus the 
process of estimating and compensating for the undesirable 
characteristics of the reproduction medium may be done 
adaptively and repeatedly as deemed necessary. 

Consider an audio source, amplified and then reproduced 
through a set of speakers. A microphone is used to feed back 
the reproduced audio source, into a processing mechanism 
This processing mechanism in turn, controls subsequent 
audio reproduction. The processing mechanism may operate 
in two phases. In the first phase, which is the training phase, 
the medium's characteristics will be estimated, and a set of 
filters is constructed, with fixed parameters. The set of filters 
will subsequently pre-filter the audio source, in order to 
equalize for the medium's characteristics, during the second 
phase which is the processing phase. If necessary, the 
pre-filter parameters may be updated by feedback of the 
reproduced audio source, even after the initial training 
period. 6 

According to this invention, during the training phase" 
umque frequency tones are transmitted (e.g., via speakers)' 
and then recorded (e.g., via a microphone). Each fed-back 
audio frequency tone is then used to estimate the gain of the 
reproduction medium at that particular frequency, and the 
background noise parameters at that frequency are also 
determined. This invention is used to construct a set of 
mverse filters, so the original audio source can then be 
pre-filtered to produce the desired audio output. 

During the second phase, which is the processing phase 
for playing back an- 7 audio source, the -audio source is 
decomposed into sub-bands whose center frequencies are 
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intensive, and is not intended for adaptive use. Once the audio sigtfal component is pis-emphasized by the gain 



medium's frequency response is measured, the equalizer 
parameters are fixed. This approach is reasonably good, but 
only for static environments, and in addition, it is quite 
computationally complex. 

SUMMARY OF THE INVENTION 
The invention addresses the problem of acoustic quality 
enhancement for such and similar systems, where the sub- 
jective quality of the audio source is affected by unknown 
and possibly time-varying characteristics of the reproduc- 
tion equipment and the environment. The invention presup- 
poses that the computational complexity of the proposed 
solution must be kept to a minimum because mobile systems 
have limited resources, and that the solution should not 
result in excessive delays in audio source reproduction. The 
invention provides a means for estimating and compensating 
tor the undesirable characteristics while minimizing both the 
computational complexity and the delay in audio source 
reproduction as required, and allow subsequent reproduction 
of an audio source that is better matched to the intended 
audio output. 
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estimates obtained during training, and also inverse filtered 
using the parameter-estimates obtai ned during training. The'' 
resulting signal is then reconstructed into a full-band signal, 
50 resulting m an actual audio output signal that is better 
matched to the intended audio output. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 schematically illustrates the overall system in 
55 accordance with the invention. 

FIG. 2 is a more detailed schematic of the system used in 
this invention. 

FIG. 3 is a more detailed schematic of the filtering unit. 
60 FIG. 4 is a schematic of the sub-band inverse filter. 

DESCRIPTION OF THE PREFERRED 
EMBODIMENT 

FIG. 1 illustrates the overall system of the invention 
55 Shown is computer 110, speakers 120 and microphone 130 
FIG. 2 is a more detailed schematic of system 100. . Com- 
puter 110 co mprises the audio data source 140, the filtering 
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unit 200 and the training unit 400. Also shown in FIG 2 is 
the reproduction medium 300, which includes speakers 120. 
/""TIG. 3 describes the filtering unit 200, which is included 
' m computer 110. This unit is used for processing the audio 
signal in order to compensate for the effects of the repro- 
duction medium, which includes the speakers and the envi- 
ronment in which the system is operating. Unit 210 is a 
sub-sampling and decimation process. Unit 220 is the sub- 
band inverse filter, and unit 230 is the up-sampling or 
interpolation process. Unit 240 is an additional stage, where 
signals from various interpolation stages 230 are added 
together to form the desired audio output signal. 

The preferred embodiment consists of two phases. The 
first phase is the training phase, and the second phase is the 
processing phase. 

Again, referring to FIG. 2, the training phase is the first 
phase of the implementation. The audio signal produced by 
unit 110 is reproduced through the speaker units 120 The 
audio signal travels through the reproduction medium 
which comprises the speakers 120 and the environment' 
(-During the twining phase, a unique set of freq uency tones is 
\ generated by the training unit 400, and reproduced by the 
J speakers 120. The training signal shall comprise at least one 
/ frequency tone in each of the M frequency sub-bands that 
V collect.vely span the range of frequencies that comprise all 
Vudio signals generated by audio data source 140 The 
Selection of an appropriate value for M and the values for M 
/frequency sub-bands may be done using guidelines for 
/sub-band coding of speech and audio signals, such as those 
/ „ eS t I ? bed m " Speech Codin 8 and Synthesis", edited by W 
J B Klein and K. K. Paliwal (Elsevier, 1995), and incorpb- 
/ rated herein by reference. The audio signal thus reproduced 
J by speakers 120 is received and digitally recorded by 
/ microphone 130. The digitized signal is separated into M 
frequency sub-bands, using standard sub-band filtering tech- 
niques such as those described in the aforementioned Klein 
e al reference, and incorporated herein by reference. The' 
filtered signal is then used to estimate the parameters of the 

^ht^H m Z etSe filterS 220 (See nG - 3 >> ^"g stan <Jard 
sub-band filter estimation procedures, such as those 
described in Adaptive Filter Theory-Third Edition" by S 
Haykin (Prentice-Hall, 1996) and The aforementioned 
Klein, et al reference, and incorporated herein by reference 
Once the estimation of the filter parameters of the sub-band 
inverse filters is done, the training phase is completed The 
training phase may be invoked whenever additional tuning 
of the sub-band filters arc desired, such as when there is a 
'cnange in the environment, of at regular intervals. 

Again referring to FIG. 2, once the training phase is 
complete the processing phase may be used to improve the 
quality of any digitized audio signal to be reproduced by 
reproduction medium 300. The sub-band inverse filters 220 
may be implemented as a transversal filter. Construction of 
transversal filters may be done as described in The afore- 
mentioned Klein, et al reference, and incorporated herein by 
reference. (See FIG. 3.) The audio signal to be reproduced 
is first passed through unit 210 for sub-sampling and 
decimation, filtered by sub-band inverse filters 220 
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up-sampled or interpolated by unit 230, and added together 
by unit 240. The processed audio signal is sent to speakers 
120 tor reproduction. 

FIG. 4 illustrates the detailed implementation of the 
sub-band inverse filter 220. The filter parameters to be 
estimated during the training phase are the coefficients c'(0) 

• ■ • c(N-l) for each of the M sub-band filters, where ill 

• • • , M. The input to filter is x'(n) which is one of the M 
sub-band components of the audio source signal X(n) 
Shown also are N delay elements where N is the length of 
he filter. N varies with the performance requirements and 
he processing power of computer 110. At each sampling of 
the source signal X(n), the components x'(n), x'(n-l) 

mu ' ti P lied b y corresponding coefficients 

c(0), c(l) c'(N-l). The products are then added by 

accumulator 221 to form the output component xYn) The 
above is repeated for each of the M sub-bands, and the 
output components x'(n) for i=l, . . ., M, to form the final 
output signal which is sent to the reproduction medium to be 
played out. 

Having thus described our invention, what we claim as 
new and desire to secure by Letters Patent is: 

1. A method of rapidly enhancing audio quality of an input 
audio signal in a portable computing system having limited 
resources and having a reproduction medium with unknown 
characteristics, said method comprising: 

a. generating a predetermined finite set of M sinele 
frequency tones; 

b. passing said set of tones through said reproduction 
medium to generate a subsequent output signal; 

c. passing said subsequent output signal through a' set of 
sub-band filters, each of the sub-band filters passing at 
least a frequency corresponding to one of the M tones; 

d. estimating the unknown characteristics of said repro- 
duction medium by examining outputs of each of said 
sub-band filters after passing said subsequent output 
signal through said medium to produce gain estimates- 

e. dynamically constructing a set of sub-band inverse 
filters to compensate for the estimated characteristics of 
the reproduction medium; and 

f. before passing an input audio signal through said 
reproduction medium, passing said audio signal 
through said inverse filters, thereby improving the 
audio quality after the audio signal passes through said 
reproduction medium. 

2. The method of claim 1 further comprising the steps of: 
decomposing the audio signal into frequency sub-bands 

prior to passing the audio signal through the inverse 
filters; and 

reconstructing an output audio signal from the filtered 
frequency sub-bands. 

3. The method of claim 2 further comprising pre- 
emphasizing the signal after said decomposing based on the 
gain estimates. 
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